On Versioning and Archiving Semantic Web Data
نویسندگان
چکیده
This paper concerns versioning services over Semantic Web (SW) repositories. We propose a novel storage index (based on partial orders), called POI, that exploits the fact that RDF Knowledge Bases (KBs) (a) have not a unique serialization (as it happens with texts) and (b) their versions are usually related by containment (⊆). We discuss the benefits and drawbacks of this approach in terms of storage space and efficiency both analytically and experimentally in comparison with the existing approaches (including the changebased approach). We report experimental results over synthetic data sets showing that POI offers notable space saving, e.g. compression ratio (i.e. uncompressed/compressed size) ranges between 1,800% and 18,163%, as well as efficiency in various cross version operations. POI is equipped with three version insertion algorithms and could be also exploited in cases where the set of KBs does not fit in main memory. Although the focus of this work is SW data versioning, POI can be considered as a generic indexing scheme for storing set-valued data.
منابع مشابه
An eGovernment System for Temporal- and Semantic-Aware Access to Norms
In this paper, we present the results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a semantic-aware system supporting efficient and personalized access to a multi-version repository of normative texts. The research activity is entitled “Semantic web techniques for the management of digital identity and the access to norms”. In the context of a c...
متن کاملOntology versioning on the Semantic Web
Ontologies are often seen as basic building blocks for the Semantic Web, as they provide a reusable piece of knowledge about a specific domain. However, those pieces of knowledge are not static, but evolve over time. Domain changes, adaptations to different tasks, or changes in the conceptualization require modifications of the ontology. The evolution of ontologies causes operability problems, ...
متن کاملTracking Changes During Ontology Evolution
As ontology development becomes a collaborative process, developers face the problem of maintaining versions of ontologies akin to maintaining versions of software code or versions of documents in large projects. Traditional versioning systems enable users to compare versions, examine changes, and accept or reject changes. However, while versioning systems treat software code and text documents...
متن کاملWikicrawl: reusing semantic web data in authoring Wikipedia
This paper presents the main part of a project conducted at the University of Warwick regarding a tool for retrieving semantic web data and reusing the retrieved data in authoring Wikipedia pages. The goal of this tool is to enable semantic web crawling with a user friendly interface by applying a semantic web framework API to an existing web archiving system and an easy way for reusing the dat...
متن کاملPersonalized access to multi-version XML documents in an eGovernment scenario
In this paper, we present some results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a multiversion repository of norm texts supporting efficient and personalized access. In particular we defined a multi-version XML data model supporting both temporal versioning –essential in normative systems– and semantic versioning. Semantic versioning is base...
متن کامل